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Method And Apparatus For Network Communication 

Technical Field 

The invention relates to a method and apparatus for 
network communication. In particular, it relates to a 
method and apparatus for tandemming network 
communications . 

Background 

Mobile telecommunications networks can choose between 
a large number of encoding and decoding schemes 
(codecs) for speech transmission. However, when two 
networks select different codecs (or different parts 
of the same network select different codecs) , then 
communications between those two entities requires 
tandemming . 

For example, a coding sequence between a CDMA (code 
division multiple access) mobile phone and a GSM 
(global system for mobile communication) mobile phone 
may be as follows: 

i. A CDMA mobile phone on a first network encodes 
speech with CDMA codec 1. 

ii. Codec 1 encoded speech is transmitted to a 
CDMA base station. 

iii. The CDMA Base station decodes the codec 1 
speech and encodes the result using PCM (pulse 
code modulation) . 

iv. The PCM encoded speech is transmitted via a 
wire-line to second, GSM, network. 

V. A GSM base station of the second network 

decodes the received PCM speech and encodes 
the result using GSM codec 2. 
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vi. Codec 2 encoded speech is transmitted to a GSM 
mobile phone on the second network. 

Thus in the above tandemming arrangement, the low 
5 bandwidth, high compression codecs used for wireless 
transmission are linked by a common high bandwidth, 
low compression PCM encoding scheme for the wireline 
part of the communication. 

10 However, the resulting end-user received speech tends 
to be of poor quality. The primary reason is that 
speech reconstructed from one high compression codec 
is generally not ideal as input to another high 
compression codec. Such codecs typically generate 

15 high-level parameterisations of the speech with 
minimal redundancy, with the result that the 
reconstructed speech used by the PCM contains 
regularities and approximations not found in the 
original. A second codec seeking to generate a 

20 slightly different set of high-level 

parameterisations will find that the salient 
characterising information it assumes to be present 
has been removed or just interpolated by the first 
codec. The result is a poor representation of the 

25 speech by the second codec. 

Currently, the concept of tandem-free operation (TFO) 
addresses this problem (see ETSI, ^^Technical 
Specification Digital cellular telecommunications 
30 system (Phase 2+) ; Universal Mobile 

Telecommunications System (UMTS); Inband Tandem Free 
Operation (TFO) of speech codecs; Service 
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description; Stage 3 (3GPP TS 28.062 version 5.3.0 
Release 5)" ETSI TS 128 062 V5 . 3 . 0 (2002-12)). 

However, it only does so if the two networks have the 
5 same codec available. That is, the same access 

technology or compatible (e.g. between AMR (adaptive 
multi-rate) capable GSM networks and 3GPP (third 
generation partnership project) networks), and 
additionally only if end-to-end negotiation on call 
10 set-up is possible. 

Thus it is not applicable when dissimilar codecs are 
used or when end-to-end negotiation is not possible 
or not implemented. 

15 

Dilithium Networks also provide a solution to the 
problems raised by tandemming, known as Unicoding™. 
( http : //www . dilithiumnetworks . com/technology/voice . ht 
m) 

20 

This solution requires that one of three alternatives 
be pursued: Either the first codec's data is conveyed 
to the second network prior to translation to it's 
codec format, or that the data is translated in the 
25 first network to the second codec's format before 
being sent to the second network, or that the data 
from the first codec is routed to a proxy server to 
perform the translation and then routed from the 
proxy server to the second network. 

30 

Referring to FIG. 1, Unicoding employs CELP (code 
excited linear predictive) codec parameter 
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translation from one codec data format 110 to another 
130 and requires dedicated translation modules 120, 
130 to be available for all possible codec to codec 
permutations . 

5 

This is not a simple solution however as, for 
example, just for 3GPP2 to GSM networks this would 
require Unicoding translation modules to be available 
to and from each of the four 3GPP2 codecs (IS-733, 

10 IS-96A, EVR (enhanced variable rate) and SMV 

(selectable mode vocoder) ) to and from each of the 
three GSM codecs (Full-Rate, Half-Rate and AMR 
including EFR (enhanced full rate) ) . These twelve 
permutations are then further compounded by the 

15 multiple available modes for SMV (2 or 3 likely 

deployment modes) and the 10 modes of AMR, increasing 
the permutations to 60 or 72. Whilst there would be 
significant commonality between many of these, the 
problems of developing and deploying a large number 

20 of Unicoding translation modules over a number of 
networks, and the process of redeployment upon the 
introduction of any new codecs makes the solution 
appear unwieldy. 

25 Many of the principles applied in the Dilithium 

Networks solution can also be found in H-G. Kang, H. 
K. Kim & R. V. Cox, ''Improving Transcoding Capability 
of Speech Coders in Clean and Frame Erasured Channel 
Environments," Proceedings of the 2000 IEEE Workshop 

30 on Speech Coding, 2000. 
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There appears to still be a need for an alternative 
method of tandem communication that provides both 
improved voice quality and a simple means of 
operation across one or more networks. 

5 

The purpose of the present invention is to address 
the above problems. 

Summary of the Invention 
10 The present invention provides a method of tandem 

communication between at least a first portion of a 
network suitable for voice communications and a 
second portion of a network suitable for voice 
communications . 

15 

In a first aspect, the present invention provides a 
method of tandem communication, as claimed in claim 
1. 

20 In a second aspect, the present invention provides a 
method of tandem communication, as claimed in claim 
8. 

In a third aspect, the present invention provides 
25 apparatus for tandem communication, as claimed in 
claim 12 . 

In a fourth aspect, the present invention provides 
apparatus for tandem communication, as claimed in 
30 claim 13. 
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Further features of the present invention are as 
defined in the dependent claims. 

Embodiments of the present invention will now be 
described by way of example with reference to the 
accompanying drawings, in which: 

Brief description of the drawings 

FIG. 1 is a block diagram showing a tandem 

communication method in the prior art. 

FIG. 2 is a block diagram showing a tandem 
communication method in accordance with an embodiment 
of the present invention. 

Detailed description 

Referring to FIG. 2, a method of tandem communication 
is proposed between at least a first portion of a 
network suitable for voice communications and a 
second portion of a network suitable for voice 
communications. These portions may be parts of the 
same or separate networks. 

The inventors of the present invention have 
appreciated that an alternative to low-compression, 
high-bandwidth PCM speech coding may be employed that 
obviates the need for decoding data from a first 
codec into PCM speech, and then re-encoding it using 
a second codec. 

This alternative, named the common compressed voice 
format (CCVF) , is a common data format intended to 
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take advantage of the fact that the majority of 
codecs currently in use are of the CELP variety. The 
CCVF provides a common form of representation for any 
such codec that it supports. 

5 

The CCVFs common form of representation allows a 
single, complex translation module 220, 230 to take 
in the CCVF representation and output the relevant 
second codec representation without the combinatorial 
10 problems experienced by the separate translation 
modules of the Dilithium Unicoding^" solution. 

Moreover, when a network provider introduces a new 
codec, the onus is solely on that network provider to 

15 update their own version of the CCVF encoder 210 and 
converse translation module 220; Other networks will 
be able to use the CCVF representation so produced 
without modification, and naturally only the network 
provider itself requires CCVF translation back to the 

20 new codec. This greatly simplifies the deployment and 
maintenance of a tandemming solution. 



Thus, in an embodiment of the present invention, a 
common data format is applied to an encoded signal 
25 that has been produced by a codec of the first 

portion of a network (hereinafter ^first codec'). 

The CCVF encoder 210 applying the common data format 
comprises means for quantising parameters of the data 
30 encoded by a codec, as well as means for describing 
the type of codec scheme used to encode the data. 
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Thus upon application to a first codec, the common 
data format comprises quantised parameters of the 
encoded signal produced by the first codec and 
descriptors characterising the coding scheme of the 
first codec. 

Specifically, the common data format may describe any 
or all of the following coding scheme 
characteristics ; 

i. The type of quantisation format for LPC 
{linear predictive coding) sets; 

ii. The number of LPC sets quantised per frame; 

iii. The order of the LPC 

iv. The number of LPC interpolations per frame; 
V. The type of interpolation rules for the LPCs; 

vi. The number of sub-frames per frame for LTP 
updates; 

vii. The number of sub-frames per frame for 
codebook updates; 

viii. The type of pitch sharpening present^ if any; 
and 

ix. The type of codebook encoding format. 

Similarly, the common data format may include any or 
all of the following encoded signal parameters; 

i. LPC vector; 

ii. lag durations; 

iii. LTP gain; 

iv. pitch sharpening coefficient; 
V. fixed codebook components; and 
vi . codebook gains . 
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The above characteristics typically apply to CELP 
based codecs, but it will be clear to a person 
skilled in the art that these lists can be altered to 
include other encoded signal parameters and coding 
5 scheme characteristics as necessary. 

To illustrate the CCVF encoder 210, a specific 
example is given below for encoding GSM EFR: 

10 Descriptors for coding scheme characteristics: 

Descriptor No. bits 

used 

Quantization format of LPC sets = 
15 2 

Number of LPC sets quantized per frame 
2 

Number of LPC Interpolations per frame 
3 

20 Interpolation rules for the LPCs 
4 

Number of subframes per frame for LTP updates 
3 

Number of subframes per frame for Codebook updates= 

25 3 

Pitch Sharpening Present 
1 

Codebook Encoding format = 
3 

30 

Quantised parameters of the encoded signal: 



9 



CML 01372EV/GBRI/ DRE / G i bb s 



Parameter No. bits 

used 

LPC Vector Quantised 2x frame @ 50 bits -> 2x50 = 
100 

5 Lags 4x frame @ 10 bits -> 4x10 = 

40 

LTP Gain 4x frame @ 8 bits -> 4x8 

36 

Pitch Sharpening Coefficient 4x frame -> 4x8 
10 36 

(Optional in GSM EFR) 

Ternary Coded Excitation (160 samples) -> 254 = 
254 

(Losslessly coded) 
15 Codebook Gains 4x frame @ 10 bits -> 4x10 = 

40 



Total Bits 

498 

20 (Bit rate of 24.9 

kb/s.) 

Thus the format of the GSM EFR scheme is described 
using a small number of bits to specify coding scheme 
details such as the use of 2 LPC vectors per frame, 

25 or that the codebook format is ternary coded 

excitation. The coding scheme details need not be 
transmitted with every frame, though they can be if 
desired. Typically the various parameters of the 
encoded signal are quantised with little or no 

30 compression. 
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Note that in theory it would be desirable for the 
CCVF formatting to be able to accommodate all 
possible parameterisations and descriptors. Whilst 
the characteristics of current and likely CELP codecs 
5 may easily be anticipated and included, clearly any 
codec representing a significant departure from the 
CELP model may necessitate the introduction of an 
updated CCVF. This in turn would necessitate an 
update of all translation software across networks. 

10 However, the inventors of the present invention 
consider that this would be an infrequent event. 
Moreover, such changes to the CCVF may be avoided by 
using the parameterisations and descriptors currently 
included as part of the CCVF to encode the synthetic 

15 speech from the first codec, though clearly this 
encoding-translation will be sub-optimal. 

The resulting CCVF representation of the encoded 
speech is transmitted over a wireline (landline) in 
20 lieu of a PCM signal. The wireline may further be 
part of a public switched telephone network or a 
packet switched network. 

A second portion of a network, the second portion 
25 either being part of the same overall network as the 
first portion, or part of a separate network, then 
receives the CCVF representation. 

A base station or other suitable apparatus 220, 230 
30 within the second portion then performs codec 

parameter translation from the common compressed 
voice format 210 into an encoded signal compatible 
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with a second codec format supported by the second 
portion of the network, requiring dedicated code to 
be available for the supported codec. 

5 If the second codec is the same as the first codec, 

the step of translation simply comprises dequantising 
the common data format representation and 
substantially reconstituting the original encoded 
signal . 

10 

If the second codec is different to the first codec, 
the step of translation comprises dequantising the 
common compressed voice format representation and 
applying a conversion process to convert components 
15 of the encoded signal produced by the first codec 
into components compatible with the second codec. 

For example, the first codec may represent line 
spectral frequencies (LSFs) using a first 

20 quantisation scheme; the conversion process 

dequantises the LSFs according to the first scheme, 
and then quantises them according to a scheme of the 
second codec. By this method the second codec obtains 
parameters of the original speech without the 

25 inherent problems of encoding reconstructed speech 
discussed previously. 

The main benefit of this approach over the prior art 
is that the translation is from a single common data 
30 format (the common compressed voice format), rather 

than from a disparate group of codecs. By decomposing 
individual codecs into their constituent parameters 
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in a prescribed fashion, the CCVF removes the need to 
treat each codec as an individual entity and so 
avoids the combinatorial problems seen in the prior 
art . 

5 

In an embodiment of the present invention, apparatus 
210 for tandem communication comprises application 
means to apply a common data format to an encoded 
signal produced by a codec. 

10 

The apparatus 210 implementing the application of the 
common data format comprises means for quantising 
parameters of the data encoded by a codec, as well as 
means for describing the type of codec scheme used to 
15 encode the data. 

In an embodiment of the resent invention, apparatus 
220, 230 for tandem communication comprises 
conversion means for converting a common data format 
20 representation of an encoded signal into an encoded 
signal compatible with at least a first specific 
codec . 
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